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INSECTICIDAL PROTEINS 

The present invention relates to novel 
insecticidal proteins and uses thereof. 

The organism Bacillus thurinaiensis produces a 
crystal -associated protein 6 -endotoxin which kills 
insect larvae upon ingestion. It is not, however, 
toxic to mammals. It is thus very useful as an 
agricultural insecticide, in particular against 
Lepidoptera, Coleoptera and Diptera. Strains of 
Bacillus thurinqiensis have been used as 
agricultural insecticides for a number of decades. 

B thurinqiensis 6 -endotoxins include those 
specifically insecticidal to lepidopteran larvae 
(such as the Cryl type proteins: Hoefte and 
Whiteley, 1989, Microbiol. Rev., 53:242-255), those 
specifically insecticidal to coleopteran larvae 
(such as the Cry II I type proteins: Krieg et al, 
1893, J. Appl. Entomol., 96:500-508) and those with 
dual specificity for lepidoptera and coleoptera 
(such as the CryV protein: International patent 
application publication number WO90/13651; European 
patent application publication number 474662; 
Tailor et al, 1992, Molec. Micro,, 6:1211-1217). 
Plant transformation constructs encoding such 
proteins have been used to produce insect resistant 
transgenic plants (Ely, 1993, in Bacillus 
thurinqiensis , An Environmental Biopesticide : 
Theory and Practice, eds . Entwistle et al, pp 
105-124) . 

B thurinqiensis 6 -endotoxins are large, fairly 
insoluble proteins which are produced as parasporal 
crystals, and which often aggregate when in 
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solution. The lepidopteran-active Cryl type 
6 -endotoxins are synthesized as precursor molecules 
which are "activated" by trypsin- like cleavage in 
the larval midgut. These activated core endotoxins 
are then resistant to further degredation by insect 
gut enzymes. After ingestion, the 6-endotoxins 
bind to larval midgut epithelium; the ahelical N- 
terminal domain of the protein is then thought to 
penetrate the epithelial cell membrane. At this 
stage 6 -endotoxin binding is irreversible and 
midgut integrity is destroyed, eventually leading 
to osmotic disequilibrium and cessation of feeding. 
Because this is a multistep process, larval death 
is relatively slow and may take from three to seven 
days . 

B thurinoiensis 6-endotoxins have been shown 
to retain insecticidal activity when in 5' 
translational fusion to "marker" proteins such as 
neomycin phosphotransferase (Hoefte, 1988, FEBS 
Lett., 226:364-370) and to be correctly processed 
(Honee, 1990, Appl. Env. Micro., 56:823-825). 

Attempts have been made to extend the host 
range of B thurinaiensis 6-endotoxins, making them 
active against a wider range of insects. For 
example, International patent application 
publication number W09117254-A (University of 
California, equivalent to US patent numbers 5143 905 
and 53 06628) describes a chimeric protein 
comprising a first protein segment having 
insecticidal activity and a second protein segment 
(target ting protein) capable of binding strongly to 
the insect gut epithelium. The insecticidal 
protein is preferably a 6-endotoxin (Cry protein) 
of Bacillus thurinaiensis . The targetting protein 


wo 95/11305 


PCT/GB94/02274 


3 

is a viral protein (such as gp64 glycoprotein of 
Autoarapha California Nuclear Polyhedrosis Virus) 
or a bacterial protein having high affinity for the 
lipid components of membranes. The Bacillus 
thurinaiensis 6 -endotoxins (Cry proteins) are not 
used as targetting proteins. W09117254-A further 
describes DNA comprising a first DNA fragment 
encoding an insecticidal protein domain and a 
second DNA fragment encoding a targetting protein 
domain. Such chimeric prpteins can be expressed in 
transgenic plants. 

European patent application number 340948 
(Mycogen Corp, equivalent to US patent number 
5290914) describes a hybrid pesticide protein 
toxin. The hybrid toxin comprises a cytotoxic 
agent and a gut epithelial cell recognition portion 
of a Bacillus thurinaiensis protein. The Bacillus 
thurinaiensis protein portion is located at the 
C- terminal end of the hybrid protein. The 
cytotoxic agent is specifically a ribosome- 
inactivating enzyme (preferably derived from plant 
seeds) or an ADP-ribosylating enzyme (such as 
diphtheria toxin) . Such cytotoxic agents may be 
highly toxic to mammals . 

Various taxonomic groups use "venoms" to 
immobilize prey. These groups include spiders, 
scorpions, snakes (Quistad et al, 1992, J. Econ. 
Entomol., 85:33-39), mites (Tomalski and Miller, 
1991, Nature, 352:82-85), marine worms (Bon et al, 
1985, Neurochem. Int., 7:63-75) and others . Venoms 
are herein defined as biologically active 
secretions comprising proteinaceous components; 
non-proteinaceous components may also be present in 
such secretions . The components of many venoms 
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include small proteins which are insecticidal . For 
example, the tox34 family of related proteins is 
obtainable from female mites of the species 
Py emotes tritici. Certain venom-derived 
insecticidal proteins have been shovm to be highly 
specific for insect sub-cellular structures such as 
sodium channels (McCutchen et al, 1991, 
Bio/Technology, 9:848-852), and to be non-toxic to 
mice (Stewart et al, 1991, Nature, 352:85-88; 
Tomalski and Miller, 1991, Nature, 352:82-85). 
Therefore these venom-derived insecticidal proteins 
are potentially useful as agricultural 
insecticides. For example, European patent 
application publication number 431829 (Agracetus 
Inc, equivalent to US patent number 5177308) 
describes insecticidal proteins produced by insect 
predatory Arthropods, including the insect-specific 
neurotoxin AaHIT from the scorpion Androc tonus 
australis Hector, the toxins BelTl and BeIT2 from 
the scorpion Buthus epeus, agatoxins from the 
spider Aaelenopsis aperta . and a neurotoxin from 
the spider Sestria florentina. Such toxins may be 
encoded by a genetic construct for expression in 
various transgenic host organisms, including 
microorganisms and plants. 

European patent application 431829 (Agracetus) 
also describes transgenic plants containing two 
separate genetic constructs, each encoding a 
separate insecticidal toxin of different toxicity. 
For example, the first construct may encode a 
Bacillus thurinaiensis 6 -endotoxin while the second 
construct encodes an Arthropod- derived insecticidal 
protein. The presence of these two different 
toxins in the same plant' may delay development of 
resistance by susceptible insects. 
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Venoms are normally delivered by injection; 
they are either inactive or of low insecticidal 
activity if ingested (Agracetus, European Patent 
Application 431829), presumably because the venom's 
components are degraded by insect gut digestive 
enzymes. This severely limits the usefulness of 
venom-derived proteins as insecticides. 

It is one object of this invention to provide 
an improved method to deliver a venom-derived 
insecticidal protein to the insect gut. 

According to the present invention we provide 
a chimeric protein comprising an N- terminal portion 
fused to a C-terminal portion, the N-terminal 
portion being at least part of a Bacillus 
thurinqiensis 6-endotoxin including the a-helical 
N-terminal domain and the gut epithelial cell 
recognition domain and the C-terminal portion being 
a venom-derived insecticidal protein. 

The N-terminal portion and the C-terminal 
portion are both obtainable from natural sources. 
The B thurinqiensis 6-endotoxin portion of the 
chimeric protein may be insecticidally active. The 
venom-derived insecticidal protein may be any 
protein or polypeptide which is normally 
insecticidal when injected but inactive or less 
active when ingested by an insect. A variety of 
suitable small, pharmacologically-active, insect - 
selective toxophores are obtainable from venoms of 
various taxonomic groups, including Arthropods such 
as scorpions and spiders, mites, snakes, marine 
worms and others. For example, venom-derived 
insecticidal proteins include the tox34 proteins 
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obtainable from female mites of the species 
Pvemotes tritici , the insect -specif ic neurotoxin 
AalT from the scorpion Androctonus australis 
Hector, the toxins BelTl and BeIT2 from the 
scorpion Buthus eoeus . agatoxins from the spider 
Aaelenopsis a perta . and a neurotoxin from the 
spider Sestria f lorentina > Being insect -specific 
toxophores, such venom-derived insecticidal 
proteins have no known mammalian toxicity and may 
therefore be used more easily for agricultural 
purposes . 

The two separate portions of the chimeric 
protein (the 6 -endotoxin and the venom-derived 
protein) are encoded by a single recombinant gene 
and are bound efficiently together into one fusion 
protein during protein synthesis in the chosen 
transformed host organism. The 6 -endotoxin portion 
is not merely acting as an insecticidal protein but 
is used to enfold and protect the venom-derived 
insecticidal protein, preventing its break-down 
during ingestion. Furthermore, fusion to the 
6 -endotoxin portion delivers the venom- derived 
protein to the insect gut. The gut epithelial cell 
recognition domain of the 6 -endotoxin portion 
allows the chimeric protein to attach to the gut 
wall (such that the chimeric protein has the normal 
host range of the separate 6 -endotoxin) . The 
a-helical N-terminal domain then causes breakdown 
of the gut wall. Cleavage may then release the 
venom-derived insecticidal protein which may pass 
through the disintegrating insect gut wall. Thus 
the chimeric protein has an advantage over the 
separate venom-derived protein as its insecticidal 
activity will be maintained when ingested by an 
insect. The chimeric protein may also show 
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improved activity over the separate Bacillus 
thurinaiensis 6 -endotoxin as the speed of larval 
death is increased. 

The invention further provides a DNA construct 
capable of expressing a chimeric protein comprising 
a DNA sequence encoding at least part of a Bacillus 
thurinaiensis 6-endotoxin including the Of-helical 
N- terminal domain and the gut epithelial cell 
recognition domain in 5 ' -translational fusion to a 
venom-derived insecticidal protein. The 
orientation of the 6-endotoxin portion with respect 
to the venom-derived protein is essential: the 
6-endotoxin comprises the 5' end of the construct 
and the venom-derived protein comprises the 3' end 
of the construct , 

The chimeric fusion protein is encoded by one 
DNA construct comprising a chimeric gene in the 
correct orientation to regulatory elements (such as 
promoters, ribosome- binding sites, termination 
sequences, introns) appropriate for the chosen 
transgenic host (microorganism or plant) . 

E coli strains carrying cloned genes for 
chimeric translational fusion proteins may be 
prepared by growing cells to stationary phase on 
solid nutrient media (such as L agar) prior to 
scraping cell growth from the agar surface, 
lyophilising, and freezing for storage. This 
material may be tested directly for insecticidal 
efficacy if appropriate control strains (such as 
the same E coli carrying only the cloning vector) 
are included in the insect bioassays . 
Alternatively, the chimeric genes may be cloned in 
an E coli vector that facilitates purification of 
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the cloned fusion protein by affinity 
chromatography. E coli cloning vectors and 
affinity purification methods and materials are 
commerically available (for example , those from New 
England Biolabs, Beverly, MA) . Expression may also 
be carried out using known methods in eukaryotic 
systems such as yeast, or plants. Pure fusion 
proteins prepared from any of these expression 
systems may be used in insecticidal compositions by 
the addition of any desired and appropriate 
formulating agents. Formulating agents which may 
be useful include, for example, surface active 
agents (such as wetting agents) , solid diluents, 
dispersing agents and UV stabilisers. If desired, 
solid formulations may be prepared by known 
methods . 

The invention further provides a method to 
combat insects by exposing them to a chimeric 
protein according to the invention. 

In the chimeric proteins of the invention, 
orally- active B thuringiensis 6 -endotoxins are used 
as delivery molecules for venom-derived 
insecticidal proteins normally delivered by 
injection. Thus non- Bacillus thuringiensis 
toxophores may be effectively transported to the 
insect gut. The 5 ' -translational fusion of a B 
thuringiensis 6 -endotoxin with a pharmacologically- 
active insecticidal molecule allows delivery of the 
latter to the insect mid-gut encased in a large, 
fairly insoluble, digestion-resistant protein. The 
Bacillus thuringiens 6 -endotoxin portion of the 
chimeric protein will bind to and disrupt the 
midgut epithelium of susceptible insect larvae, 
allowing delivery of the venom- derived toxin to the 
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insect's haemocoel . The latter moiety may be 
released from the 6 -endotoxin moiety by 
trypsin-like cleavage in the insect gut* This 
provides a faster means for achieving larval death 
and allows venom-derived insecticidal proteins to 
be delivered orally, either by conventional 
application or by expression in transgenic plants. 

Generally, plants infested or liable to 
infestation by insects are treated {eg sprayed) 
with insecticidal compositions as described above 
diluted with a diluent such as water. The 
insecticidal agent is the chimeric fusion protein; 
if desired this may be applied to the plants 
independently of the bacterial, yeast or plant 
expression system that produce it - 

The invention also provides a transgenic plant 
containing DNA encoding a chimeric protein of the 
invention. 

The plant susceptible to insect attack may 
produce the chimeric fusion protein in situ . This 
is achieved by cloning the chimeric gene by known 
means, providing it with a suitable promoter (for 
example the CaMV35S promoter) which will cause 
expression of the gene in plants, and transforming 
the plant by known methods (such as bombardment of 
plant suspension cells with DNA- coated particles or 
Aarobacterium -mediated transformation) . Any 
moncotyledonous or dicotyledonous plant species may 
be transformed with a DNA construct encoding the 
chimeric insecticidal protein . 

SEQ ID NO 1 shows the single- stranded form of 
a synthetic AaHIT gene with flanking regions for 
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cloning. Although this sequence encodes the AaHIT 

insect -specific peptide toxophore from the North 

African scorpion Androctonus australis Hector, the % 

codon usage differs from the naturally-occuring 

gene. The sequence has been altered to optimise 

expression in £ coli and dicotyledonous plants and 

to introduce unique restriction sites into the gene 

or flanking regions. The synthetic gene may be 

used in DNA constructs encoding a chimeric protein 

according to the invention. Furthermore, Example 1 

illustrates a method of creating a trypsin cleavage 

site within the chimeric protein to allow release 

of the AaHIT protein moiety in the insect gut • In 

fact, this is a generally-applicable method for 

creating a trypsin-sensitive cleavage site within a 

chimeric protein to allow release of a protein 

moiety cloned in 3 ' -translational fusion as part of 

the chimeric protein. In the process of 

constructing a DNA sequence encoding the chimeric 

protein (having a first protein moiety in 

translational fusion to a second protein moiety) , a 

sequence encoding an in-frame trypsin-sensitive 

cleavage site is introduced between the first and 

second protein moieties. The second protein moiety 

may thus be released from the fusion by 

trypsin-like enzymes. 

The following examples illustrate the 
invention with reference to the Sequence Listing in 
which: 

SEQ ID NO 1 shows the sequence of a synthetic 
AaHIT gene with flanking regions for cloning; * 

SEQ ID NO 2 shows the amino acid sequence of a 
trypsin recognition site; 

SEQ ID NO 3 shows the sequence of a 162 -mer 
sense strand longmer; 
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SEQ ID NO 4 shows the sequence of a 162-mer 
antisense strand longmer; 

SEQ ID NO 5 shows the sequence of a 34-mer 
sense strand primer; 

SEQ ID NO 6 shows the sequence of a 34-mer 
sense strand primer; 

SEQ ID NO 7 shows a fragment of the wildtype 
CrylA(c) gene sequence starting at base number 
1984; 

SEQ ID NO 8 shows a fragment of the CrylA(c) 
protein sequence from amino acid residues 696 to 
699; 

SEQ ID NO 9 shows a fragment of the 
PCR-modified CrylA(c) gene sequence, starting at 
base number 1984; 

SEQ ID NO 10 shows the sequence of the fusion 
region of the chimeric CrylA(c) -AaHIT DNA 
construct ; 

SEQ ID NO 11 shows the sequence of the fusion 
region of the chimeric CrylA (c) -AaHIT protein. 

EXAMPLE 1 

Construction of a fully synthetic AaHIT gene 

SEQ ID NO 1 shows the single -stranded form of 
a synthetic AaHIT gene with flanking regions for 
cloning. The sequence has been altered to optimise 
expression in E coli and dicotyledonous plants and 
to introduce unique restriction sites into the gene 
or flanking regions . 

In the synthetic gene, the open reading frame 
(ORF) encoding the AaHIT peptide begins with the 
ATG codon at positions 42-44, and terminates at the 
TAA codon at positions 256-258. The ORF contains 
unique recognition sites for the following 
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restriction endonucleases : Prne l (bases 182-189), 
Xhol (bases 205-210) , and Pad (bases 251-258) . 
The doiible- stranded version of SEQ ID NO 1 is 
designed with the following engineering features to 
be used, for example, when the sequence is cloned 
into the E coli vector pUC19 at the EcoRl and 
Hin di II sites of the pUC19 polylinker after the 
vector has had the single Nde l site removed by 
known methods (such as digestion, Klenow polymerase 
fill-in and religation) . 

(a) Method of creating a bltint-ended DNA fragment 
beginning at the ORF start codon 

Digestion with BspMl (recognition site at 
bases 33-38; cleavage at bases 42 on the sense 
strand and 4 6 on the antisense strand) followed by 
filling in of the single -stranded DNA ends using 
Klenow polymerase in the presence of dNTPs produces 
a blunt-end DNA fragment beginning with base 43, 
the first base in the AaHIT ORF. This feature 
allows exact fusion products to be formed when used 
in conjunction with appropriate restriction and 
modifying enzymes to manipulate the 3 ' end of the 
AaHIT gene - 

(b) Method of creating a trypsin cleavage site to 
release the venom-derived insectlcldal protein 
moiety 

The cloned AaHIT gene is digested with Nde l at 
the now-unique Ndel site (bases 40-45) followed by 
filling in of the single-stranded ends with Klenow 
polymerase in the presence of dNTPs. This is 
followed by digestion with the blunt-cutting enzyme 
Nru l at the unique Nru l site (bases 27-32) . 
Re- ligation results in an altered sequence in which 
the pair of bases, 2 8 and 29, are adjacent base 52, 
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thus producing an in- frame Arg codon (CGT) . The 
amino acid sequence with the codon begining with 
the C at position 19 is now in-frame relative to 
the AaHIT ORF and reads "Arg, Gly, Arg, Arg, Met" 
(SEQ ID NO 2), with the Methionine being the start 
codon for AaHIT. This amino acid sequence 
constitutes a trypsin recognition site with 
cleavage between Arg and Met, and allows release of 
the separate AaHIT peptide from the chimeric 
protein by trypsin- like enzymes. 

(c) Method to effect in- frame fusions of genes 
encoding the portions of the chimeric protein 

Cleavage at the unique Eco Nl site (bases 
13-23) is designed to facilitate in-frame fusions 
to the C-terminal portion of any B thurinaiensis 
6-endotoxin protein. It may also be useful to 
incorportate an in- frame polyproline hinge at the 
EcoNl site between the 6 -endotoxin and the AaHIT 
portions of the chimeric gene. Polyproline hinges 
may be of various lengths. Such hinges may 
facilitate correct folding of the AaHIT moiety and 
may provide better trypsin-mediated cleavage and 
release of the AaHIT moiety in the larval gut. 

(d) Construction of a synthetic AaHIT gene with 
flanking DKA sequences using the polyaerase chain 
reaction (PGR) 

Synthetic long deoxyoligonucleotides 
(longmers) and short primer oligonucleotides 
(primers) are constructed as follows: 

a 162-mer sense longmer (SEQ ID NO 3) ; 

a 162 -mer antisense strand longmer (SEQ ID NO 4) ; 

a 34-mer sense strand primer (SEQ ID NO 5) ; 

a 34-mer sense strand primer (SEQ ID NO 6) . 
Longmers were synthesized on an Applied Biosystem 
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model 380B DNA synthesizer on a 40 nmole synthesis 

scale, under conditions in which the coupling 

efficiency was at least 98%. In the case of the 

longer synthetic products (the two antiparallel 

164-iners) , trityis were monitored visually 

throughout the synthesis, and the synthesizer was 

paused during the synthesis to replenish the 

phosphor amidi t es . Absorbance at ^^^^ was used to 

A^Dunm 

estimate oligomer concentrations. Longmers and 
primers were deprotected at 60 ®C for 3 9 or 15 hours 
respectively, prior to f reeze-drying and storage at 
-20®C, Further purification of longmers was not 
required. Each longmer synthesis results in a 
mixture of molecules of varying lengths, all of 
which have identical 3' ends. The full length 
164 -mer sense and antisense oligomers possess a 5o 
base pair overlap region. 

Synthetic longmers were resuspended in 
sterile, deionized, glass-distilled water to a 
final concentration of 2 fig/fil, aliquoted and 
stored at -20°C. Synthetic primers were 
resuspented in sterile, deionized, glass-distilled 
water to a final concentration of O.l^g/^1, 
aliquoted and stored at -20**C. 

Polymerase chain reactions were done in two 
"rounds". Round 1 provided core gene synthesis ( 
that is, extension of longmers in the absence of 
the flanking primers) . Round 1 consisted of a 5' 
denaturation period at 94<>C prior to addition of 
the TlmpliTaq polymerase and 10 cycles of: 

melt 1' at 94 °C 

anneal 2' at 65 ®C 

extend 1' at 72«»C. 
Round 2 provided core gene amplification in the 
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presence of the flanking primers. For Round 2, 
1/25 of Round 1 reaction was added to a new 
reaction mix prior to 30 cycles of: 

melt 1' at 94*>C 

anneal 2' at 65**C 

extend 1' at 72«>C. 
Reaction mixes were composed as follows: 
Round 1 - Core Gene Extension (final 
concentrations, 50/il reactions) : 

50mM KCl 

lOmM Tris ph 8,3 at room temperature 
2.5mM MgClj 

0,25mM each dNTP 
TM 

2 .5U AmpliTaq 

A fig each longmer (sense and antisense) 
Round 2 - Core Gene Amplification (final 
concentrations, 50/il reactions) : 

SOmM KCl 

lOmM Tris ph 8.3 at room temperature 
2.5mM MgCl^ 
0.25mM each dNTP 
2.5U AmpliTaq™ 

1/25 volume Round 1 reaction mix 

200ng each flanking primer 
PCR products of the correct size (274 base pairs; 
for example, corresponding to the double -stranded 
version of the sequence presented in SEQ ID NOl) 
were observed by electrophoresis on 2% agarose 
gels. The synthetic PCR product was cloned using 
known methods . 
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EXAMPLE 2 

Construct! n of a chimeric DMA c nstruct 
encoding a CryZA(c) -AaHIT Insectlcidal fusion 
protein 

A suitable CrylA(c) gene fragment may be 
isolated as a 6.6 kilobase Hin di II fragment from 
the DNA of recombinant £ coli strain NCIB 40211 
obtained from the National Collection of Industrial 
and Marine Bacteria (23 St Machar Drive, Aberdeen, 
Scotland AB2 IRY) . The crylACc) gene is then 
engineered as an in-frame 5' -translational fusion 
to the amino terminus of the AaHIT gene at any 
point dovmstream of the natural trypsin- sensitive 
site between amino acid residues 622 and 623 of 
CrylA(c) . DNA sequences added between the 
CrylA(c)- and AaHIT encoding portions may include 
those encoding a polyproline hinge and may also 
include those which add a trypsin-sensitive site. 
One method for constructing this fusion involves 
production of a synthetic C-terminal region for 
crv IA(c) by known polymerase chain reaction (PCR) 
methods such that an insectlcidal gene product is 
encoded by the crv IA(c) -derived sequence. A 
variety of possible lengths exist for the 
cry lA(c) -derived moiety. 

As an example, this method was used to alter 
the CrylA(c) coding sequence for amino acids 696 to 
6 98 such that the natural protein sequence was 
maintained but an EcoNl site was created in the DNA 
sequence as shown below. The following sequence is 
from the wildtype CrylA(c) starting at base number 
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1984 (SEQ ID NO 7) and encodes amino acid residues 
696 to 699 (SEQ ID NO 8) : 

5'..,C C A G A A C G T G G G.,.3' 
...GGTCTTGCACCC ... 
Pro Glu Arg Gly 

The following sequence is from the PCR-modified 
CrylA(c) starting at base number 1984 (SEQ ID NO 9) 
and encoding amino acid residues 696 to 699 (SEQ ID 
NO 8) . The PCR-altered bases are shown in bold and 
marked by the symbol ** . The EcoN l site is 
underlined and the cleavage sites are indicated by 
the symbols t and ^ . 

i 

5' . . . cCT O GAACGA <=> GG G. . .3' 
...GGA CTTGCT CCC... 
t 

Pro Glu Arg Gly 

Digestion of the altered cry lA(c) gene with Eco Nl 
then allowed construction of a chimeric gene by 
altering the AaHIT gene as described above to 
create a site encoding a trypsin-sensitive peptide 
bond, prior to digestion with Eco Nl and ligation to 
the digested cri:IA(c) gene by known methods. This 
results in a CrylA(c) -AaHIT chimeric protein, the 
fusion region of which is shown below. The 
following sequence is the sense strand (SEQ ID NO 
10) encoding the fusion region of the chimeric 
CrylA(c) -AaHIT protein (SEQ ID NO 11). Sequences 
derived from the B thurinqiensis CrylA(c) gene are 
shown in bold (five bases and two amino acids at 
the 5' end) . The remaining sequence derives from 
DNA SEQ ID NO 1, after manipulation as described 
above. The asterisk (*) indicates the 
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trypsin-sensitive cleavage site. Met is the start 
codon for the AaHIT gene. 

5'...CCTGAACGAGGACGTCGTATG.,.3' 
. . . Pro Glu Arg Gly Arg Arg Met . • . 

EXAMPLE 3 

Construction o£ a chimeric DNA construct 
encoding a CryV-AaHIT insecticidal fusion protein 

A suitable cryV gene may be isolated the DNA 
of recombinant E coli strain NCIB 40278 or from the 
DNA of S thurinaiensis strain NCIB 40091 obtained 
from the National Collections of Industrial and 
Marine Bacteria (23 St Machar Drive, Aberdeen, 
Scotland AB2 IRY) . The cryV gene is then 
engineered as an in-frame 5' -translational fusion 
to the amino terminus of the AaHIT gene at any 
point downstream of CryV amino residue 648. DNA 
sequences added between the CryV- and 
AaHIT-encoding portions may include those encoding 
a polyproline hinge and may also include those 
which add a trypsin- sensitive site. One method for 
constructing this fusion involves production of a 
synthetic C- terminal region for cryV by known 
polymerase chain reaction (PCR) methods such that 
an insecticidal gene product is encoded by the 
cryV-derived sequence. A variety of possible 
lengths exist for the cryV- derived moiety. This 
provides an in- frame cloning site anywhere from 
CryV amino acid residue 648 to residue 720, and an 
encoded trypsin cleavage site just upstream of the 
AaHIT initiation codon. 


wo 95/11305 


19 


PCT/GB94/02274 


EXAMPLE 4 

Construction of a chimeric DNA construct 
encoding a CrylllA-AaHIT ineecticidal fusion 
protein 

A suitable cry lllA gene may be isolated as a 
3.0 kilobase DNA frament of Hin di I I - digested 
genomic DNA from g thurinaiensis strain NCIB 4 0023 
obtained from the National Collections of 
Industrial and Marine Bacteria (23 St Machar Drive, 
Aberdeen, Scotland AB2 IRY) . The crv IIIA gene is 
then engineered as an in-frame 5 ' -translational 
fusion to the amino terminus of the AaHIT. DNA 
sequences added between the CrylllA- and 
AaHIT-encoding portions may include those encoding 
a polyproline hinge and may also include those 
which add a trypsin-sensitive site. One method for 
constructing this fusion involves production of a 
synthetic C-terminal region for cry lllA by known 
polymerase chain reaction (PGR) methods such that 
an insecticidal gene product is encoded by the 
cryl I lA- derived sequence, A variety of possible 
lengths exist for the cryl I lA- derived moiety. This 
provides an in- frame cloning site in the CrylllA 
coding sequence and an encoded trypsin cleavage 
site just upstream of the AaHIT initiation codon. 
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EXAMPLE 4 

Construction of a chimeric ONA construct 
encoding a CryZZIA*AaHIT insecticidal fusion 
protein 

A suitable cry lllA gene may be isolated as a 
3.0 kilobase DNA frament of gindlll -digested 
genomic DNA from B thurinaiensis strain NCIB 40023 
obtained from the National Collections of 
Industrial and Marine Bacteria (23 St Machar Drive, 
Aberdeen, Scotland AB2 IRY) . The crv IIIA gene is 
then engineered as an in-frame 5' -translational 
fusion to the amino terminus of the AaHIT. DNA 
sequences added between the CrylllA- and 
AaHIT-encoding portions may include those encoding 
a polyproline hinge and may also include those 
which add a trypsin-sensitive site. One method for 
constructing this fusion involves production of a 
synthetic C- terminal region for crv IIIA by known 
polymerase chain reaction (PGR) methods such that 
an insecticidal gene product is encoded by the 
cryl I lA- derived sequence. A variety of possible 
lengths exist for the cryl I lA- derived moiety. This 
provides an in- frame cloning site in the CrylllA 
coding sequence and an encoded trypsin cleavage 
site just upstream of the AaHIT initiation codon. 
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SKQXJENCE LISTING 


(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: ZENECA LIMITED 

(B) STREET; 15 STANHOPE GATE 

(C) CITY: LONDON 

(D) STATE: ENGLAND 

(E) COUNTRY: UNITED KINGDOM 

(F) POSTAL CODE (ZIP) : WIY 6LN 

(ii) TITLE OF INVENTION: INSECTICIDAL PROTEINS 
(iii) NUMBER OF SEQUENCES: 11 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 
v(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1,25 (EPO) 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 9321469.0 

(B) FILING DATE: 18-OCT-1993 


(2) INFORMATION FOR SEQ ID NO: 1: 

{1} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 274 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(P) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GGAATTCCCG GGCCTGAACG AGGACGTCGC GAACCTGCGC ATATGAAAAA AAACGGTTAC 60 

GCAGTTGATT CTTCTGGTAA AGCGCCAGAA TGTTTGTTGT CTAACTACTG TAACAACCAA 120 

TGTACCAAAG TTCACTACGC TGATAAAGGC TACTGTTGTT TGTTGTCTTG TTACTGTTTT 180 

GGTTTAAACG ATGATAAAAA AGTTCTCGAG ATTTCTGATA CCCGTAAATC TTACTGTGAT 240 

ACCACCATTA TTAATTAAGA GCTCAAGCTT GGGC 274 
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AGTAGCCTTT ATCAGCGTAG TGAACTTTGG TACATTGGTT GT 
(2) INFORMATION FOR SEQ ID NO: 5: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 


162 


(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GGAATTCCCG GGCCTGAACG AGGACGTCGC GAAC 
(2) INFORMATION FOR SEQ ID NO: 6: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


34 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6; 
GCCCAAGCTT GAGCTCTTAA TTAATAATGG TGGT 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid ^ 

(C) STRANDEDNESS: single [ 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA ] 


34 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 
CCAGAACGTG GG 

(2) INFORMATION FOR SEQ ID NO : 8; 

(i) SEQUENCE CHARACTERISTICS: 


12 
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(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 8: 

Pro Glu Arg Gly 
1 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQXIENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CCTGAACGAG GG 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CCTGAACGAG GACGTCGTAT G 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 


(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 11: 

Pro Glu Arg Gly Arg Arg Met 
1 5 
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CLAIMS 

1. A chimeric protein comprising an N-terminal 
portion fused to a C-terminal portion, the 
N-terminal portion being at least part of a 
Bacillus thurinaiensis 6 -endotoxin including 
the a-helical N-terminal domain and the gut 
epithelial cell recognition domain, and the 
C-terminal portion being a venom-derived 
insecticidal protein. 

2 . A chimeric protein as claimed in claim 1 in 
which the venom-derived insecticidal protein 
is an AaHIT peptide obtainable from 
Androctonus australis Hector. 

3. A DNA construct capable of expressing a 
chimeric protein comprising a DNA sequence 
encoding at least part of a Bacillus 
thurinaiensis 6 -endotoxin including the 
a-helical N-terminal domain and the gut 
epithelial cell recognition portion in 5'- 
translational fusion to a venom-derived 
insecticidal protein. 

4 . A DNA construct as claimed in claim 3 in which 
the venom-derived insecticidal protein is an 
AaHIT peptide obtainable from Androctonus 
australis Hector . 

5 . A DNA construct as claimed in claim 4 in which 
the sequence encoding the venom- derived 
insecticidal protein is SEQ ID NO 1. 
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A method to combat insects comprising exposure 
of the insects to a chimeric protein as 
claimed in claim !• 

A transgenic plant comprising a DNA construct 
as claimed in claim 3 . 

A method to create a trypsin- sensitive 
cleavage site within a chimeric protein, said 
chimeric protein having a first protein moiety 
in translational fusion to a second protein 
moiety, which comprises cloning a DNA sequence 
encoding the chimeric protein and introducing 
a region encoding an in- frame 
trypsin- sensitive cleavage site between the 
first and second protein moieties such that, 
when the encoded chimeric protein is 
expressed, the second protein moiety is 
releasable from the fusion by trypsin- like 
enzymes . 
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1) Chimeric gene for conferring on plants increased tolerance to a herbicide against EPSPS. 2) tt comprises, 
in the direction of transcription, a promoter region, a transit peptide region, a sequence encoding tolerance to 
glyphosate and a polyadenylatton signal region, characterised in that the transit peptide region comprises, in 
the direction of translation, at least one transit peptide from a plant gene encoding a plastid-localised 
enzyme, a portion of the sequence of the mature N-terminal portion of a plant gene encoding a 
plastid-localised enzyme, then a second transit peptide from a plant gene encoding a plastid-localised 
enzyme. 3) Production of glyphosate-tolerant plants. 
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